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In the late 1940s Richard Hamming recognized that the further evolution 
of computers required greater reliability, in particular the ability to detect and 
correct errors. (At the time, parity checking was being used to detect errors, 
but was unable to correct any errors.) He created the, Hamming Codes, perfect 
l-error correcting codes, and the extended Hamming Codes, 1-error correcting 
and 2-error detecting codes. Hamming Codes are still widely used in computing, 
telecommunication, and other applications including data compression, popular 
puzzles, and turbo codes. 


Definition A code, C is a subset of A”, for some set or alphabet A. 


Definition A code with a field as alphabet and which is a linear space over the 
field is a linear code. 


Definition A word is an element of A”. A codeword is an element of C. 


Here the alphabets will be finite fields. Linear codes with length n and 
dimension k will be described as [n,k] codes. Hamming Codes are linear codes, 
and a Hamming Code will be described as a [n,k] q-ary Hamming Code, where q 
is the size of the base field, F}. In other words an [n,k] q-ary Hamming Code is 
a linear subspace of the n-dimensional vector space over Fy. As an introduction, 
here is a concrete construction of a [7,4] binary Hamming Code. 

Begin with 4 ordered bits (base field F2) of information, and rename these 
as £3, £5, £6, £7, respectively. Choose 


£4 = T5 T Te T T7 (€ Fə) 





TQ = T3 T LE x7 ( F>) 











Ly = T3 £5 T7 ( Fə). 


Our Hamming codeword is then (x1 £2 £3 £4 £5 £e £7) So, for instance, if we 
started with (1 0 1 0) we would get, step-by-step, 
(a1 £2 124011) 
(x1 T2 1001 1) 
(z1110011) 


(0110011). 
Now, for the error correction we do the following. Let 


Qa = £4 T £5 T Te T T7 


b = T2 T T3 T Te T T7 











c = T1 + T3 + T5 + 17, 


and suppose that one error occurred during transmission of the codeword 
(£1 £2 £3 £4 £5 £e £7). Then abc, interpreted as a binary number, will give the 
subscript of the bit that is in error. For instance suppose we sent (0110011) 
and (0 1 1011 1) was received. In this case we compute a = 1, b= 0, c= 1, 
and abc=101, which is the binary representation of 5 so we know that the 5*” 
bit is wrong. (If there is no error then abc = 000 indicates no error occurred.) 
For a more general construction of [n,k] binary codes we need the definitions 
of generator and check matrices. 


Definition A generator matrix G for an [n,k] linear code C (over any field F,) 
is a k-by-n matrix for which the row space is the given code. In other words 
C = {xG xe FF}. 


Definition The dual of a code C is the orthogonal complement, C+. 


Definition A check matriz for an [n, k] linear code is a generator matrix for 
the dual code. 


If C is an [n,k] linear code, the dual to it is an [n, n-k] linear code. If M 
is the check matrix for C, M is an (n — k) x k matrix the rows of which are 
orthogonal to C and {x | Mx! = O}=C. 

Our general construction of a binary Hamming Code is actually a construc- 
tion of a matrix, from which we’ll define the Hamming Code as the linear code 
for which this matrix is the check matrix. For a given positive integer r form 
an r x (2" — 1) matrix M by making the columns the binary representations 
of 1,...,2" — 1 (not necessary in increasing order, though this is often most 
convenient). Now {x | Mx! = 0} is a linear code of dimension 2” — 1 — r, which 
we define to be a [2" — 1,2” — 1 — r] binary Hamming Code. Call {x | Mx! = 0} 
a [2" — 1, 2" — 1 — r] binary Hamming Code. 

The [7,4] binary Hamming Code (r = 3) first introduced has check matrix 


0 
0 
1 


O =. © 


0 111 
10) 20%. ae IP 
10101 


oo me 


the code is 


C={(0000000), (20-00 04.) (0100101), (0001111 
0011001), (0010011), (0010110), (0110011 
0110111), (0101111),(1000011),(1011010 
(1110000), (1001100), (1100110), (1111111) 


The construction also produces, for r = 3, the check matrix 


1 0 0 0 1 1 1 
0101011], 
0 0 1 1 10 1 

which gives a different set of Hamming codewords, and thus a different [7,4] 


binary Hamming Code. The word (1 000 11 1) is in this latter code, but does 
not appear in the list for the former. 


Definition The Hamming distance dy between any two words of the same 
length is defined as the number of coordinates in which they differ. 


Any mention of distance herein refers to Hamming distance. 
Proposition The minimum distance between binary Hamming codewords is 3. 


Proof: Suppose x and y are two codewords from a Hamming Code, C with 
check matrix M. Then x — y € C, since C is a linear code. If dy(x,y) = 1, 
then M(x-y) is a column of M. All columns of M are nonzero, but if (x-y) 
is a Hamming codeword, then M(x-y) = 0. Contradiction. If dy(x,y) = 2 
then M(x-y) = 0 iff there are two columns of M which are linearly dependent. 
This is not the case, hence dy(x,y) > 3 for all codewords x, y. Every check 
matrix for a binary Hamming Code will have three columns that are linearly 
dependent, so in fact some codewords are of distance 3. 


It is easy to see the Hamming distance is a metric. Then any word within 
distance 1 to a codeword is, in fact, within distance 1 to a unique codeword. 
Thus if any Hamming codeword is transmitted and at most 1 error occurs then 
the original codeword is the unique codeword within distance 1 to the received 
word. Thus it is true that the minimum distance between any two Hamming 
codewords is > 3, then it is true that Hamming Codes are 1-error correcting. 
Decoding any received word to this nearest Hamming codeword corrects any 
single error. 

In the general case error correction is even easier than in the introductory 
concrete construction. Again, we are assuming no more than one error during 
transmission. The method is called syndrome decoding. In the case of binary 
Hamming Codes syndrome decoding takes the following form. Both the sender 
and receiver of a word have agreed on an [n,k] binary Hamming Code with check 
matrix M. Upon receiving a word y, the receiver will compute My", which will 
be a binary r-tuple and thus be either the zero r-tuple, in which case there is 


no error, or be identical to one column of M (since M cointains every possible 
nonzero binaryr-tuple as a column), say m;. This tells the receiver that an error 
has occurred in the i*” coordinate of y. 

Consider the example from the concrete construction above where we sent 
(0110011) and (011011 1) was received. The syndrome is 


0 
1 
0001111 1 1 
0110011 ol=ļ]o0], 
1010101 1 1 
1 
1 


whic matches the 5“ column of the check matrix. We take this to mean that 
the 5” bit of the received word is in error. 

Hamming Codes run into problems if more than one error has occurred. If 
in the last example the word (1 1 1 0 1 1 1) is received (errors in places 1 and 
5) then the syndrome will be 


1 
1 
0001111 1 1 
01100 11 o/=|o], 
1010101 1 0 
1 
1 


telling us to switch the 4!” bit, which will give the word (0111111). Thus 
more than one error can not be corrected, nor even can we know that more 
than one error has occurred. The extended Hamming Codes are slightly more 
powerful, able to detect when 2 errors have occurred as well as able to correct 
any single error. 

Returning to the introductory construction of a [7,4] binary Hamming Code, 
we include a new ” parity check” bit, xo, with 


£o = T1 + T2 + T3 + £4 + £5 + T6 + L7, 


so that all eight digits sum to 0. The code now has length 8 and is still a linear 
code of dimension 4. We call this code an [8,4] extended binary Hamming Code. 
The construction of an extended binary Hamming Code which corrects one error 
and detects two follows the same procedure for any length n: just add a parity 
check bit. 

Check matrices can easily be constructed for the extended binary Hamming 
Codes from the check matrix for a Hamming Code: add a zero column on the 
left, then a row of all 1’s on the bottom. For example the Hamming Code with 


check matrix 





0001 1 1 1 
0 11 100 1 1j, 
1 O 1 0O 1 0 1 
becomes the extended Hamming Code with check matrix 
010 0 O 1 1 1 1 
0;0 11001 1 
0/1 010 1 0 1 
1 | o a a a 


Suppose that x and y are binary Hamming codewords of distance 3. Then 
one of x or y has even parity and the other odd, say x has even parity. If 
x’ and y’ are the extended Hamming codewords obtained from adding a check 
digit. Then zp = 0 since x has even parity and yo = 1 since y has odd parity. 
The distance between x’ and y’ is one more than the distance between x y, so 
the minimum distance between codewords of an extended Hamming Code is 4. 
Now any received word with one error is distance 1 from a unique codeword, 
and a received word with 2 errors is not within distance 1 from any codeword. 
If a word is within distance 1 to a codeword then we decode the word to that 
codeword as before. If a word is not within distance 1 to any codeword, then we 
recognize that 2 errors have occurred and report that two errors have occurred. 

Decoding with an extended Hamming Code is a little more complicated. Let 
M be a check matrix for an extended Hamming Code, let M’ be the (regular) 
Hamming Code matrix from which is was derived, and let y = (yo Yı --- Yn) be 
a received word. Suppose only one error has occurred, and suppose first that it 
has occurred in the last n bits. Computing the first n rows of the syndrome My! 
is the same as computing the syndrome M'(y; ... yn)’. This last syndrome 
will be nonzero (since there is an error in one of these bits). The last row of the 
syndrome My' will be 1, since the parity is off with only one error. Thus the 
syndrome matches a column of M. 

Suppose now that only the parity bit is in error. Then M’(y, ... yn)! will 
be zero, so the first n columns of My' will be zero, but the last column will be 
1 since the parity is off again, thus My! will match the first column of M. 

Thus as long as the syndrome matches a column of the check matrix then we 
can assume 1 error has occurred and switch the bit of the word corresponding 
to that column. 

Now suppose 2 errors have occurred. Wherever they occur the parity of the 
entire word will be correct, thus the syndrome will have a 0 in the last row 
and will not be a column of the check matrix. The syndrome will not be zero 
either since codewords of the extended Hamming Code are distance at least 4 
apart. Thus a nonzero, non-column syndrome indicates 2 errors (assuming that 
no more than two errors occurred). 

We can generalize our construction of binary Hamming Codes to q-ary Ham- 
ming Codes, where an [n,k] Hamming Code is now a linear space over a field of 
order q, prime. For a given r, choose a nonzero r-tuple of elements of F}. Choose 


another r-tuple from Fy linearly independent from the first. Make this column 
2. Continue choosing r-tuples such that any two are linearly independent until it 
is no longer possible to do so. Arrange these r-tuples as the columns of a matrix 
M. There are q” — 1 possible nonzero r-tuples, and each choice eliminates its (q- 
1) nonzero multiples from further consideration. Thus the number of columns 
of our matrix is (g” — 1)/(q — 1). Define the (linear) code for which this is a 
check matrix to be a [(q” —1)/(q— 1), (q —1)/(¢-—1) — 17] q-ary Hamming Code. 
Recall that a check matrix is the generator matrix of the dual code, which in 
this case must have dimension r, hence the dimension of our Hamming Code 
must be n — r = (q — 1)/(q — 1) - r. 
A [4,2] ternary (3-ary) Hamming Code can be given by the check matrix 


1 1 2 0 
0 1 1 1 





which has Hamming Code 


{(0000), (0111), (0222), 
1012), (2021),(1120), 
2210), (1201) (2102) } 


The same Hamming Code is produced with the check matrix 
0 1 1 1 
1 0 12’ 

A different [4,2] Hamming Code results from the check matrix 
0 2 1 2 
1 0 1 1j’ 

Hamming Code: 


{(0000), (0121), (0212), 
1011), (2022), (1102), 
(2201), (1220) (2110) }. 


These q-ary codes are again 1-error correcting, relying on that fact that each 
codeword is at a distance of at least 3 from any other codeword, which in turn 
relies on the construction of the check matrix. Specifically, the fact that no two 
columns of the check matrix are linearly dependent means that the minimum 
distance between any two codewords is at least 3. 

Syndrome decoding (of at most one error) easily extends to the q-ary case. 
Again we compute the syndrome My" for check matrix M and received word 
y. If the syndrome is the zero vector then we assume no error. If the syndrome 
is nonzero then it is a multiple of some column. If the syndrome is am, for 
aeF, and m; the it column of M, take this to mean that an error vector was 
added to the intended codeword during transmission, where the error vector 


is (0... 0a@0 ... 0), with a in the it spot. Then we recover the intended 
codeword asy—(0...0a0... 0). 


Definition A sphere of radius p centered at x is 


Sp(x) = {y € F;'| du(x,y) < p}. 


For any x € FY’ , |So(x)| = 1, that 1 being just x. There are (q-1) ways 
to change any coordinate of x and get a new word, and n possible coordinates 


to change. Hence |S1(x)| = 1+ n(q — 1). There are a ways to choose 
two coordinates to change, and (q — 1)? ways to change the two, so |S2(x)| = 


1l+n(q-1)+ (5) (q—1)°. In general, 


From discussion above, a code is e-error correcting if Se(x) N Sely) = 0 
whenever x # y. Thus, if C is an e-error correcting code, |C] - |S-(x)| < |Fo|”, 
(for any x since the cardinality of the sphere does not depend on the choice of 
x). This inequality is called the sphere-packing bound. 


Definition An e-error correcting code is called a perfect e-error correcting code 
if |C|- |Se(a)| < Fal”, or if 


icl= LD3 (")(a-1) 


Proposition Hamming Codes are perfect 1-error correcting codes. 


Proof: We need to check that 
: n a n 
cl 32 (")@- D = FI”. 
i=0 


The right hand side of this is q”, where n = (q” — 1)/(q— 1). The left hand side 
is 





ratna) = e (14E 2q-y) 
= (+r -1) 
= eg) 


Thus Hamming Codes are perfect l-error correcting codes. 
Definition A e-covering code is a code C C A” for which A” = U{ Se(x| x € C}. 
Proposition If C is an e-covering code then |C|-|S.(x)| > |F”. 


Definition If equality holds in the last proposition then C is called a perfect 
e-covering code. 


Of course the perfect e-error correcting codes and the perfect e-covering 
codes coincide. It is known that Hamming Codes are the only perfect 1-error 
correcting codes. 


